# Low VRAM Optimization

Hidream I1
Other
A ControlNet PEFT LoRA model based on HiDream-I1-Full, supporting text-to-image and image-to-image conversion
Image Generation
H
ControlNetLoRA
605
0
Mochi Lora
Apache-2.0
A LoRA fine-tuned version based on the Mochi-1 preview model, focusing on text-to-video generation tasks
Text-to-Video
M
weathon
112
1
GLM4 32B Neon V2
MIT
A roleplay fine-tuned version based on GLM-4-32B-0414, with excellent performance, distinctive personality, diverse styles, and elegant writing.
Large Language Model Transformers English
G
allura-org
171
7
Orpheus Awq
Apache-2.0
The 4-bit AWQ quantized version of Orpheus-3b FT, optimized for text-to-speech tasks and supporting voice cloning functionality.
Speech Synthesis English
O
YaTharThShaRma999
48
3
Deepseek V3 0324 GGUF UD
MIT
DeepSeek-V3-0324 is a dynamically quantized version provided by Unsloth, supporting inference frameworks like llama.cpp and LMStudio.
Large Language Model English
D
unsloth
6,270
6
Deepseek V3 0324 GGUF
MIT
The current V3-0324 model is the best-performing quantized version in its size category, significantly reducing volume while maintaining performance close to Q8_0
Large Language Model Other
D
ubergarm
1,712
20
SDXL GGUF
MIT
GGUF-format quantized version of Stable Diffusion XL, offering different quantization levels to accommodate various hardware configurations.
Text-to-Image
S
HyperX-Sentience
2,189
5
Qwenfluxprompt
Apache-2.0
This is a LoRA trained for the Wan2.1 14B video generation model, suitable for text-to-video and image-to-video tasks.
Video Processing Supports Multiple Languages
Q
mam33
25
0
Cat Text To Video 2.3b
Apache-2.0
A text-to-video model based on conditional enhancement, extending generated segments and achieving smooth transitions through temporal condition transformers, supporting prompt interpolation functionality
Text-to-Video English
C
motexture
25
1
Minicpm O 2 6 Int4
The int4 quantized version of MiniCPM-o 2.6, significantly reducing GPU VRAM usage while supporting multimodal processing capabilities.
Text-to-Audio Transformers Other
M
openbmb
4,249
42
Shu Qi
FLUX.1-dev is a text-to-image generation model based on Stable Diffusion technology, supporting LoRA fine-tuning, suitable for creative image generation tasks.
Image Generation
S
Jonny001
425
2
Illustrious
Apache-2.0
The Illustrious model is a text-to-image AI model capable of generating high-quality images from text descriptions.
Text-to-Image English
I
calcuis
3,975
9
Controlnet Kohaku Canny Sdxl Fp16
A ControlNet model based on Stable Diffusion XL, specializing in precise image generation control through Canny edge detection
Image Generation
C
r3gm
19
0
Hunyuanvideo Gguf
Other
GGUF quantized version of Tencent's Phantom Video model, designed specifically for ComfyUI for text-to-video generation tasks
Text-to-Video
H
city96
6,142
162
FLUX.1 Fill Dev GGUF
Other
FLUX.1-Fill-dev is a text-to-image generation model based on FLUX technology, specializing in image inpainting tasks.
Text-to-Image English
F
second-state
691
3
Aria Sequential Mlp Bnb Nf4
Apache-2.0
A BitsAndBytes NF4 quantized version based on Aria-sequential_mlp, suitable for image-to-text tasks with approximately 15.5 GB VRAM requirement.
Image-to-Text Transformers
A
leon-se
76
11
Flux.1 Lite 8B Alpha
Other
Flux.1 Lite is an 8B-parameter Transformer model distilled from the FLUX.1-dev model, maintaining the same precision (bfloat16) while reducing memory usage by 7GB and improving runtime speed by 23%.
Text-to-Image
F
Freepik
1,810
415
Seba Ai
MIT
A video generation model based on CogVideoX-5b, capable of producing high-quality video content from text descriptions
Text-to-Video English
S
GlitchXRiot
13
2
Cogvideox 2b
Apache-2.0
CogVideoX is the open-source version of the video generation model from Qingying. The 2B version is an entry-level model that balances compatibility with low operational and development costs.
Text-to-Video English
C
rttrsabc
22
1
Chromafur Alpha Gguf
Other
ChromaFur Alpha is a text-to-image generation model converted to GGUF format, suitable for low-end GPUs or users who prefer fast loading.
Image Generation
C
WWizrd
13
1
Cogvideox 2b
Apache-2.0
CogVideoX is an open-source video generation model originating from Qingying. The 2B version is an entry-level model, balancing compatibility with low operational and development costs.
Text-to-Video English
C
THUDM
40.55k
324
Herobophades 3x7B
Apache-2.0
HeroBophades-3x7B is an experimental Mixture of Experts (LLM) model built using mergekit, designed to run in 4-bit mode on GPUs with 12GB VRAM.
Large Language Model Transformers
H
nbeerbower
20
3
Erosumika 7B V3 7.1bpw Exl2
Erosumika-7B-v3 is a 7.1bpw exl2 quantized language model suitable for running 16k context on GPUs with 8GB VRAM. It was created by fusing multiple models using the DARE TIES method, primarily for entertainment-oriented fictional writing.
Large Language Model Transformers English
E
Natkituwu
24
1
Mangaka
Other
A Stable Diffusion model specifically designed for generating anime/manga storyboards
Image Generation Other
M
parsee-mizuhashi
472
5
Animatediff Motion Adapter V1 5 3
AnimateDiff is a technology that leverages existing Stable Diffusion text-to-image models to create videos by inserting motion module layers to achieve coherent motion between image frames.
Video Processing
A
guoyww
800
8
Show 1 Sr2
Show-1 is an efficient text-to-video generation model that combines the advantages of pixel and latent space diffusion models, capable of producing high-quality videos with precise text alignment.
Video Processing
S
showlab
127
10
Show 1 Sr1
Show-1 is an efficient text-to-video generation model that combines the strengths of pixel and latent space diffusion models to produce high-quality videos closely aligned with text prompts.
Video Processing
S
showlab
128
3
Sygil Diffusion
A fine-tuned version based on Stable Diffusion, supporting multi-level namespace control for image generation elements, effectively avoiding context confusion issues
Image Generation Supports Multiple Languages
S
Sygil
1,578
41
Colorjizz 512px
Openrail
A 512px resolution color style model based on Stable Diffusion 1.5, activated by the prompt 'colorjizz' to generate vibrant color effects through 130 training images
Image Generation
C
plasmo
14
5
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase